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ABSTRACT 

We study the clustering of galaxies in real and redshift space using the Optical Redshift 
Survey (ORS). We estimate the two point correlation function in redshift space, £,{s), 
for several subsamples of ORS, spanning nearly a factor of 30 in volume. We detect 
significant variations in ^(s) among the subsamples covering small volumes. For vol- 
umes ^ (75/i~^Mpc)'^, however, the ORS subsamples present very similar clustering 
patterns. Fits of the form^(s) = (^)~^° give best-fit values in the range 1.5 < 7s < 1.7 
, and 6.5 < sq < 8.8h^^ Mpc for several samples extending to redshifts of 8000 km s^^. 

' However, in several cases ^(s) is not well described by a single power-law, rendering 

I the best-fit values quite sensitive to the interval in s adopted. We find significant differ- 

■ ences in clustering between the diameter-limited and magnitude-limited ORS samples 

0^ ' within a radius of 4000 km s~^ centered on the Local Group; f (s) is larger for the 

magnitude-limited sample than for diameter-limited one. We interpret this as an indi- 
^ ^- rect result of the morphological segregation coupled with differences in morphological 

I ' mix. We split ORS into different morphological subsamples and confirm the existence 

2 . of morphological segregation of galaxies out to scales of s ^ lOh^^ Mpc. Our results 

indicate that the relative bias factor between early type galaxies and late-types may 
be weakly dependent on scale. If real, this would suggest non-linear biasing. 

We also compute correlations as a function of radial and projected separations, 
^(rp,7r), from which we derive the real space correlation function, ^(r). We obtain 
k> , values 4.9 < < 7.3h^^ Mpc and 1.5 < 7r < 1.7 for various ORS samples. As before, 

}_j ■ these values depend strongly on the range in r adopted for the fit. The results obtained 

' in real space confirm those found using ^(s), i.e. in small volumes, magnitude limited 

samples show larger clustering than do diameter limited ones. There is no difference 
when large volumes are considered. Our results prove to be robust to adoption of 
different estimators of ^(s) and to alternative methods to compensate for sampling 
selection effects. 

Key words: methods: statistical; galaxies: clustering; cosmology: large-scale struc- 
ture of Universe 



1 INTRODUCTION 

Early redshift surveys showed that galaxies are not dis- 
tributed randomly in the Universe (Oort 1983). They are 
found to lie in clusters, filaments, bubbles and sheet-like 
structures (Davis et al. 1982, de Lapparent, Geller & Huchra 
1986, Pellegrini et al. 1989). There exist large regions of the 



Universe which are almost devoid of galaxies (Kirshner et al. 
1981, Geller & Huchra 1988, de Lapparent, Geller & Huchra 
1988). This wealth of information about galaxy clustering 
and general distribution properties provide constraints on 
theories of galaxy and large-scale structure formation (Pee- 
bles 1980, Dekel & Rees 1987, Strauss & Willick 1995). 
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To characterize the distribution of galaxies and to quan- 
tify the degree to which it departs from a Poisson distri- 
bution, a number of statistical tools have been developed. 
The first and most widely used approach to quantify the 
degree of clustering in a galaxy sample has been the two- 
point correlation function (Totsuji & Kihara 1969, Peebles 
1980). In spite of its known limitations, it has been applied 
to all redshift surveys completed to date (e.g. Moore et al. 
1994, Martinez & Coles (1994) [IRAS QDOT], Dalton et al. 
1994 [APM Clusters], Loveday et al. 1995 [Stromlo- APM] , 
de Lapparent et al. 1988, Park et al. 1994 [CfA2], Fisher 
et al. 1994 [IRAS 1.2 Jy]). Similarly, higher-order clustering 
properties have been studied by means of the three and four- 
point correlation functions or by using counts in cells statis- 
tics (e.g. Efstathiou, Sutherland & Maddox 1990, Bouchet et 
al. 1993). Recent efforts have also been invested into directly 
reconstructing the galaxy density field (Saunders et al. 1991, 
Scharf et al. 1992, Strauss et al. 1992, Hudson 1993, Fisher 
et al. 1995b) or studying its power-spectrum (Vogeley et al. 
1992, Fisher et al. 1993, da Costa et al. 1994). 

A limiting factor in comparing observed galaxy clus- 
tering properties with model predictions is the difficulty in 
assessing whether the observations reflect the properties of 
the ensemble of galaxies in the universe (de Lapparent et al. 
1988). The search for a 'fair sample' of galaxies has led to 
the completion of ever larger redshift surveys, probing both 
depth and angular extent. A recent step towards this goal 
was the completion of the Optical Redshift Survey (ORS), 
covering 98% of the sky for |6| > 20° and containing about 
8500 optically selected galaxies (Santiago et al. 1995, 1996). 
ORS is currently the closest approximation to an all-sky 
redshift survey selected in the optical and provides a dense 
sampling of the galaxy distribution out to 8000 km s~^. 

The particular selection criteria used to define a red- 
shift sample may introduce systematic effects on the derived 
clustering properties of galaxies. Most samples are drawn 
from galaxy catalogues by applying either a diameter or a 
magnitude cut-off limit to them. Thus, it is important to 
establish if diameter-limited and magnitude-limited surveys 
sample the general galaxy population in a uniform way. In 
fact, comparisons between diameter and magnitude limited 
samples have been made by Zucca et al. (1991) but with 
much smaller samples. ORS is again an excellent dataset for 
such analyses, given that it is essentially complete in both 
diameters {9 > 1'.9) and magnitudes {tub < 14.5). 

Finally, an important observational result of previous 
galaxy clustering analyses is the existence of segregation as 
a function of morphological type. It has been qualitatively 
known for decades that elliptical or bulge-dominated ('early- 
type') galaxies prevail in the cores of rich clusters of galax- 
ies whereas disk-dominated galaxies (spirals) make up most 
of the general field population (Hubble & Humason 1931, 
Abell 1965). This result is often described as the so called 
morphology-density relation, first quantified by Dressier 
(1980) for clusters, and also known to apply for smaller 
galaxy concentrations such as groups (Postman & Geller 
1984, Ferguson & Sandage 1991). A morphology-radius rela- 
tion have also been suggested (Whitmore, Gilmore & Jones 
1993). Indications that morphological segregation also ex- 
ists among field galaxies have been given by several works 
(Davis & Geller 1976, Giovanelli, Haynes & Chincarini 1986, 
Eder et al. 1989, Lahav, Nemiroff & Piran 1990, Mo & 



Borner 1990, Santiago & Strauss 1992). While morpholog- 
ical segregation is fairly well established, at least on scales 
up to ~ Wh~^ Mpc {Ho = 100/i km s"^ Mpc"^), it has 
been more difficult to establish if the clustering properties 
of galaxies depend on luminosity (Phillipps & Shanks 1987, 
Alimi, Valls-Gabaud & Blanchard 1988, Davis et al. 1988, 
Hamilton 1988, Salzer, Hanson & Gavazzi 1990, lovino et al. 
1993, Park et al. 1994), surface brightness (Davis & Djor- 
govski 1985, Santiago & da Costa 1990, Thuan et al. 1991), 
spectral properties (Salzer et al. 1988) or circular velocity 
(White, TuUy & Davis 1988, Mo & Lahav 1993). On the 
theoretical side, the existence of segregation among galaxies 
and the indirect evidence for segregation between light and 
matter on galactic and extra-galactic scales has led to the 
formulation of scenarios of biased galaxy formation (Kaiser 
1984, Babul & White 1991). 

In this paper we address the issues of clustering de- 
pendence on survey volume, sample selection criteria and 
morphology. The paper is structured as follows. In section 
^ we briefiy introduce the ORS data. In section ^ the dif- 
ferent estimators of the two-point correlation function and 
their associated errors are presented. We also discuss dif- 
ferent schemes to compensate for selection effects inherent 
to the data. In section ^ we present results based on the 
comparison of the correlation functions in redshift space, 
^(s), for different ORS subsamples split according to vol- 
ume, selection criteria and morphological types. In section ^ 
we compute ^(rp,7r), the correlation as a function of radial 
and projected separations, and invert it to obtain the real- 
space correlation, (,{r). Our results for ^(r) are consistent 
with those of section ^. Finally, in section ^ we state our 
main conclusions and discuss our results. 



2 THE DATA 

ORS was presented by Santiago et al. 1995 (hereafter Paper 
I). In brief, the sample was drawn from three catalogues: 
the Uppsala General Catalogue of Galaxies (UGC; Nilson 
1973), the European Southern Observatory Galaxy Cata- 
logue (ESO; Lauberts 1982, Lauberts & Valentijn 1989) and 
the Extension to the Southern Galaxy Catalogue (ESGC; 
Corwin & Skiff 1994) . ORS contains two largely overlapping 
subsamples, one magnitude limited to ms < 14.5 (hereafter 
ORSm) and the other diameter limited to 9b > 1.9' (here- 
after ORSd). ORSd was created by merging ESGC, ESOd 
and UGCd, and ORSm consists of ESOm and UGCm. The 
entire ORS catalog consists of 8457 objects; redshifts are 
available for 8266. To complete the sample, roughly 1300 
new redshifts were measured - the remaining redshifts were 
taken from the literature. Most of the new redshifts were 
obtained in regions of low galactic latitude, 20° < |6| < 30°, 
and within the strip not covered by either the UGC or ESO 
catalogues ( — 17?5 < 5 < — 2?5). ORS provides the most 
detailed and homogeneous sampling to date of these areas 
of the sky. The matching of the catalogs and the derivation 
of the luminosity and diameter functions for the ORS are 
treated in Santiago et al. (1996) (hereafter Paper II). 
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Figure 1. The correlation function for ORSd (panel a) and ORSm (panel b) out to Vmax = 8000 km s ^ using the Hamilton (stars) 
and Standard estimator (diamonds). 



3 THE TWO-POINT CORRELATION 
FUNCTION 

Consider a Poissonian distribution of points in space. The 
joint probability that one point is found in each of two inde- 
pendent volume elements dVi and dV2 obviously scales with 
the mean density and the size of the volume elements 



dP = ndVidV2. 



(1) 



If the distribution is non-Poissonian then there may be an 
excess probability of finding a point in each of the two vol- 
ume elements dVi and dV2 separated by the distance r (see, 
e.g Peebles 1980): 



dP = n'^dVidV2[l + ^{r)]. 



(2) 



This excess probability, quantified by ^(r), is the two-point 
correlation function. 

Traditionally, ^(r) has been determined by the expres- 
sion 

nnDDij) 

where DD{r) and DR(r) are, respectively, the number 
of galaxy-galaxy and galaxy-random pairs with separation 
within the range r and r + dr. The latter are computed by 
creating a catalogue of randomly distributed points within 
the same volume occupied by the real galaxies, no (nR) is 
the number density of galaxies (random points) in this vol- 
ume. We hereafter refer to ^s(r) as the standard estimator. 
In spite of its widespread usage, the standard estimator ex- 
plicitly depends on the mean density assigned to the sample. 
As the lack of a fair sample may bias no, ^sir) may be (un- 
der) overestimated accordingly (de Lapparent et al. 1988, 
Davis et al. 1988). More recently, Hamilton (1993) has pro- 
posed an alternative estimator, which is much less sensitive 
to this problem: 



DD{r)RR{r) 



1, 



(4) 



[L»i?(r)]2 

where RR{r) is the random-random pair counts within the 
bin of separation r. Hamilton (1993) shows that the depen- 
dence of £,H{r) on the mean density of galaxies in the survey 
volume is of second order, whereas the dependence of (.sir) 



is of first order. Thus S^h (r) is less affected by the uncertainty 
in the mean density. 

In Fig. 1^, we compare the two estimators in redshift 
space, ^h{s) and ^s(s), for both ORSd and ORSm cut at 
Vmax ~ 8000 km s~^. Throughout this paper, we adopt the 
usual convention and use ^(s) {£,{r)) to denote the corre- 
lation function in redshift (real) space. The two estimators 
give very similar results in both cases except on the largest 
scales, where the clustering is weak. A similar trend was 
found by Loveday et al. (1995), who analysed the Stromlo- 
APM survey. Since (,h{s) gives a more reliable estimate of 
the two-point correlation function on all scales, we chose to 
use that estimator exclusively in what follows. 

In practice, the pair counts DD, DR and RR are 
weighted sums of pairs, which take into account the selection 
effects inherent to the galaxy sample used (see §3.1 below). 
Likewise for the mean number density no- The catalogue 
of random points may or may not imitate such selection 
effects, provided that the appropriate selection function is 
applied to compensate for them. In this paper we did not 
introduce any selection effects to the random points. The 
weighting scheme adopted in this work is the topic of the 
next subsection. 



3.1 Weights and selection functions 

In a diameter or magnitude limited survey, the sample will 
become sparser at larger redshifts due to the increasing loss 
of galaxies caused by the adopted apparent magnitude or 
diameter cut-off. This selection effect has to be correctly 
accounted for when determining ^(s) or any other statis- 
tic from the sample. The effect is quantified by the selection 
function, </'(r), which expresses the fraction of the total pop- 
ulation of galaxies that are expected to satisfy the sample's 
selection criterion at any given point in space. Under the as- 
sumption of a universal luminosity (or diameter) function, 
the selection function depends solely on distance. For ORS, 
however, an angular dependence is introduced by variable 
(and non-negligible) amounts of Galactic extinction as well 
as by non-uniformities in the magnitude or diameter sys- 
tems among the three different catalogues from which the 
data were derived. Paper II presents a detailed discussion of 
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Figure 2. Comparison of volume (stars) and velocity (diamonds) limited subsamples for ESOd (panel a) and UGCm (panel b). 



the problem and of the way these additional selection effects 
were incorporated into 0(r). In brief, extinction effects were 
incorporated into the maximum likelihood approach used to 
fit (j){r) (see also Yahil et al. 1991). A selection function was 
also fitted separately to each of the five ORS subsamples 
(ESOm, ESOd, UGCm, UGCd and ESGC) in order to by- 
pass the inhomogeneity in selection caused by the different 
catalogues. 

Once 0(r) is available, selection effects are in principle 
compensated for by weighting each galaxy i hy Wi = l/(j){ri). 
However, the different subsamples in ORS have different 
diameter and/or magnitude systems, which introduces an 
additional selection bias: subsamples based on deeper plate 
material tend to contain more galaxies and thus have a larger 
mean number density (Paper II). We thus normalize each 
subsample by the mean number density of galaxies in that 
subsample. This would be adequate if each subsample in it- 
self were a fair sample, but that it not necessarily the case, 
and thus, as explained in Paper 11, we renormalize again by 
the mean density of each subsample volume relative to the 
full sky, as found in the full-sky IRAS 1.2 Jy sample (Fisher 
et al. 1995a). We thus assign galgixy i a weight given by 

Wi = , (5) 

<p(ri)nk,ORS ntot.iRAS 

where nk,ORS {nk,iRAs) is the mean number density in the 
region corresponding to the fc*'' subsample in ORS (IRAS) 
and ntotjRAS is the total mean number density as com- 
puted from the IRAS 1.2 Jy sample. All number densities 
correspond to a volume-limiting distance of 500 km s^^ (cf.. 
Paper 11). 

The weighting scheme used here is not minimum vari- 
ance, in the sense of Davis & Huchra (1982) and Efstathiou 
(1988). However, minimum weighting requires prior knowl- 
edge of the correlation function. This problem has been by- 
passed in the past by usage of earlier estimates of ^(r) or of 
N-body simulations (Loveday et al. 1992,1995). For a dense 
sample like ORS, however, the practical improvement pro- 
vided by using a minimum variance method is very small 
(Hamilton 1993, Park et al. 1994). 

Finally, we should point out that the results presented 
in the next sections have been tested with different weighting 
schemes and prove to be robust. 



3.2 Error estimation 

There is no standard method of calculating errors for cor- 
relation functions (Hamilton 1993, Strauss & Willick 1995). 

One could simply assume Poisson fluctuations in the pair 
counts, but since they are correlated on any given scale, this 
is not strictly correct. Instead we use 10 bootstrap resam- 
plings of the data (Barrow, Bhavsar & Sonoda 1984, Ling, 
Frenk & Barrow 1986, Mo, Jing & Borner 1992) to esti- 
mate the variance in the calculated correlation functions. 
As shown by Fisher et al. (1994), this is likely to be an over- 
estimate of the real errors and is thus appropriate given the 
conservative approach we adopt throughout this paper. 



4 CLUSTERING IN ORS 

The ORS subsamples used here are limited at Vmax = 
8000 km s~^ in order to minimize shot noise. The selec- 
tion function parameters for those subsamples with Vmax = 
8000 km s^^ arc listed in Table 1 of Paper II. As mentioned 
before, to calculate ^(s) we created random samples with the 
same geometry as the data sample in consideration; in the 
case of ORSd, for example, we limited the random sample 
to 8000 km s"\ |6| > 20° and to regions with Ab < 0.7 as 
determined from the extinction maps of Burstein & Heilos 
(1982). When deahng with ESOm (and therefore ORSm) 
we also took into account the lack of ESO-LV magnitudes 
in several ESO plates and thus did not place random points 
within the boundaries of these plates (see Paper 1 for de- 
tails). 

We assigned unit weight to the random points and used 
logarithmic binning on all scales. We tested the code and 
our random rmmbcr generator by computing ^(s) on several 
pairs of random samples consisting of 20000 points each. As 
expected, this yielded no correlations on scales s > 0.5h~^ 
Mpc. As a final test of our code, we calculated {(s) for the 
IRAS 1.2 Jy sample (Fisher et al. 1995a) and successfully 
recovered the results of Fisher et al. (1994). 

An important issue is to test the reliability of the 
weights assigned to each galaxy. We do that by comparing 
subsamples cut at a given maximum distance Vmax (here- 
after referred to as velocity- limited at Vmax) to their volume- 
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Figure 3. Panel a: for ESGC (stars), ESOd (diamonds) and Figure 4. The same as Fig. |^ but for magnitude limited subsam- 

UGCd (triangles) cut at V^ax = 4000 km s'^ . Panel b: The pies, 
same, but for subsamples cut at Vmax = 8000 km s~^. 



limited counterparts. These latter subsamples, by definition, 
include only objects which are luminous (or large) enough 
to satisfy the selection criteria of the survey if they were 
at Vmax- This is equivalent to applying a lower limit to the 
luminosity (or absolute diameter) to the subsample, leading 
to a uniformly selected dataset with unit weight assigned 
to each object (<?!>(r) = 1, < r < Vmax)- Since both ve- 
locity and volume-limited subsamples probe the same vol- 
ume in space, they should lead to very similar ^(s), unless 
the weights in the velocity-limited subsample are biased or 
there is segregation as a function of luminosity (or absolute 
diameter) . The results are plotted in Fig. |^. Panel a shows 
ESOd out to Vmax ~ 4000 km s"^ and panel h refers to 
UGCm out to Vmax = 8000 km s~^. The agreement is very 
good in both panels, especially when one considers the rel- 
atively small number of galaxies in the volume limited sam- 
ples (386 and 433 objects in panels a and b, respectively). 
We are therefore confident that the selection function de- 
rived in Paper II is reliable and hereafter continue working 
with velocity-limited subsamples. 

Finally, we note that redshifts are a combination of the 
Hubble flow and peculiar velocities, the latter distorting the 
determination of galaxy clustering in redshift space. In the 
remainder of this section we present correlation functions for 
ORS in redshift space, ^(s). Since the main purpose of this 
work is to compare ^(s) for different samples, we adopt the 
working hypothesis that, to flrst order, peculiar velocities 



will affect the samples being compared in similar ways. In 
section ^ we calculate the real-space correlation function, 
and find similar results to those in this section, confirming 
our working hypothesis. 

4.1 ^(s) for the different subsamples 

If a survey sampling the local universe contains structures 
whose sizes are comparable to the volume of the survey it- 
self, it is not considered a 'fair sample' of the large-scale 
galaxy distribution. A detailed discussion of the effect of 
such survey-sized structures is presented by de Lapparent et 
al. (1988). Therefore, it is important to define the minimum 
survey volume which would satisfy the condition for being a 
fair sample. Since ORS provides us with a dense sampling of 
the local universe over two-thirds of the celestial sphere, we 
here investigate whether it may be considered a fair sam- 
ple, at least for the purpose of estimating ^(s). We thus 
compute ^(s) separately for the 5 ORS subsamples (ESOd, 
UGCd, ESGC, ESOm and UGCm) both out to Vmax = 4000 
km s~^ and Vmax = 8000 km s~^, with volumes spanning 
nearly a factor of 30. 

The results for the diameter-limited data are plotted in 
Fig. ^ and the corresponding plots for the magnitude limited 
subsamples are in Fig. ^. Errorbars denote one standard 
deviation from the distribution of ^(s) for the 10 bootstrap 
resamplings mentioned in section 3.2. For clarity, we have 
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only plotted errors for one of the subsamples in each of the 
two panels in Fig. ^ and Fig. ^. In panel ^ we plot errors 
for ESGC. It is the smallest of the ORS subsamples and 
correspondingly the one that has largest associated errors. 

The differences in ^(s) are significant for the subsamples 
shown in panels and ^a. There are large discrepancies be- 
tween UGC and ESO on scales 7 — 20h~^ Mpc regardless of 
selection criteria. ESGC seems to exhibit a slightly steeper 
slope and a much smaller correlation length than both the 
ESOd and UGCd samples. For Vmaoo = 8000 km s"\ how- 
ever, ^(s) is very similar for the ESO and UGC subsam- 
ples. This is true in both figures | and |. The ESGC C(s) 
still has a smaller amplitude at Vmax ~ 8000 km s~^ than 
do ESOd and UGCd. ESGC, however, covers a volume less 
than half of that of ESOd and less than a third of UGCd. 
We thus conclude that the ESOd and UGCd subsamples 
stretching out to Vmax = 8000 km s^^ are close approxima- 
tions to a fair sample (at least for the purpose of estimating 
^(s) within 20h~^ Mpc), whereas ESGC and the subsamples 
limited at Vmax = 4000 km s~^ are not. The minimum vol- 
ume that supposedly contains a representative sample of the 
ensemble, for these purposes, is thus ~ (75ft~^Mpc)'^. No- 
tice that in both ESO and UGC, diameter and magnitude- 
limited subsamples have a similar ^(s) slope. The latter, 
however, are more clustered for Vmax ~ 4000 km s~^ but 
not for Vmax = 8000 km s"\ This issue will be revisited in 
§4.2. 

For each of the subsamples large enough to be consid- 
ered good approximations to a fair sample we fit a power-law 
of the form 



as) 



(6) 



and determine the best-fit value of the slope 73 and correla- 
tion length So using a least squares fit. The scale ranges 
used in the fits and the derived values are listed in Ta- 
ble |l| The best-fit values span the range 1.5 < 7s < 1.7 
and 6.5 < so < 8.8h~^ Mpc. However, some of the cases 
shown in Fig. ^ and Fig. ^ are not well described by a 
single power-law over all scales. The presence of curvature 
in 5(s) is also visible in the APM (Loveday et al. 1995) 
and quite pronounced in the QDOT sample (Moore et al. 
1994). Moore et al. (1994) hypothesize that this is caused 
by a combination of peculiar velocities and observational 
errors which will enhance the correlations on small scales 
and suppress them on larger scales. Since the ^(s) are not 
perfect power-laws, the values listed in Table |l| are very sen- 
sitive to the scale interval used in the fits. For instance, if 
we fit a power-law to ^(s) for ORSd (ORSm) limited at 
Vmax = 8000 km s"^over the range 1 < s < 10h~^ Mpc, 
we find So = 8.19(8.58) and 7^ = -1.26(-1.18). If the fit- 
ted range is 3 < s < 16h~^ Mpc, however, the results are 
So = 7.36(7.65) and 7^ = -1.58(-1.62). The ranges listed in 
Table |l| are thus the ones where ^(s) was best fit by a single 
power-law. Also, due to the fact that the points are not sta- 
tistically independent, it is not strictly correct to use a least 
square method for this purpose. However, as demonstrated 
by Bouchet et al. (1993), it is a good first approximation, 
and in any case the uncertainty in the fitted values will be 
dominated by the strong dependence on the range in s cho- 
sen for the fit. 

We now compare our ^(s) determinations with those of 
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amonds) out to Vmax = 8000 km with previous estimates 
based on optical samples: Stromlo-APM (squares) and CfA2 (tri- 
angles) . 



other authors. We use the ORSd and ORSm correlations for 
that purpose. Power-law fits to these latter are listed in the 
last two lines of Table 0. Our ^(s) for ORSd and ORSm 
are in excellent agreement with the results obtained by de 
Lapparent et al. (1988) and by Loveday et al. (1995) for 
the CfA2 and Stromlo-APM samples, respectively, as shown 
in Fig ^. These previous works probe larger volumes than 
does ORS but contain a smaller number of galaxies, hence 
our smoother ^(s). Notice that the best-fit values to Eq. 
(5) quoted by Loveday et al. (1995) are significantly dif- 
ferent from those listed in Table 0. This is entirely due to 
differences in the scale range used in the fits, which illus- 
trates the sensitivity of the fitted values on the interval in s 
used. Finally, our ORSd and ORSm f(s) are also in excel- 
lent agreement with those determined by Davis & Peebles 
(1983) and Maurogordato, Schaeffer & da Costa (1992) for 
the CfAl and SSRS samples, respectively. This is not very 
surprising given the fact that the UGCm and CfAl overlap 
considerably, as do ESOd and SSRS. 

4.2 ORSd and ORSm 

The correlation function ^(s) for both ORSd and ORSm 
samples can be seen in Fig. |^ and Fig. ^ for Vmax ~ 
4000 km s"'^ and Vmax = 8000 km s~^, respectively. Also 
shown are the equivalent correlations for ESO and UGC 
subsamples. These latter have been offset by a constant for 
clarity. The errorbars again indicate 1 a deviations as deter- 
mined from the 10 bootstrap resamplings. The ORSd and 
ORSm samples within Vmax ~ 4000 km s~^ have signifi- 
cantly different correlation functions. ORSm is more clus- 
tered than ORSd over the entire range of scales available. 
Similar behaviour is seen for both ESO and UGC; the mag- 
nitude limited subsamples are more clustered than their 
diameter-limited counterparts in Fig. ^. Thus, it appears 
that the difference in clustering amplitude between ORSd 
and ORSm arises from similar contributions from the ESO 
and UGC subsamples. On the other hand, for Vmax = 8000 
km s~^, the ORSd and ORSm have very similar ^(s), except 
on the very largest scales, where ORSm falls off faster than 
ORSd (Fig. |). 
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Table 1. Power-law fits and parameters 
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1.65 
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UGCd 


80 


2.5 - 32 


1.48 
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UGCm 


80 


1.6 - 32 


1.64 


6.5 






ORSd 


80 


1 - 40 


1.57 


6.6 


1.59 


7.07 


ORSm 


80 


1.3 - 25 


1.51 


6.8 


1.56 


7.26 
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Figure 6. The correlation function for ORSd (stars) and ORSm 
(diamonds) velocity limited at 4000 km s~^. For comparison we 
also show g(s) for ESOd (plus signs), ESOm (triangles), UGCd 
(crosses) and UGCm (squares). These latter are offset by a factor 
of 20 for the sake of clarity. 
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Figure 7. The same as Fig. |6|, only for samples velocity limited 
at 8000 km s"^ 



The only substantial difference between the ORSm and 
ORSd volumes is the ESGC strip (-17.5° < 5 < -2.5°), 
which is not included in ORSm. Thus it is unlikely that 
the difference in the ORSm and ORSd correlations to 
4000 km is due to large-scale differences in the under- 
lying density field. To make sure, we removed ESGC from 
ORSd and computed ^(s) again. Fig. ^remained essentially 
unchanged; the vast majority of DD, DR and RR pairs in 
fact come from within ESOd, UGCd or from ESOd/UGCd 
cross-boundary pairs. 

The observed discrepancy between ORSm and ORSd in 



Figure 8. The frequency of morphological type for ORSd (dashed 
lines) and ORSm (solid lines) at Vmax = 4000 km s~^(panel a) 
and 4000 < Vmax < 8000 km s'^ (panel b). The difference m mor- 
phological mix is clearly diminished at larger redshifts, rendering 
more similar results for £,{s). 



Fig. ^ is likely to reflect real clustering differences between 
magnitude-limited and diameter-limited samples. Since the 
former tend to pick up, on average, higher surface brightness, 
earlier type galaxies than the latter, one expects ORSm to 
show enhanced clustering as a result of the larger fraction of 
stronger clustered early-type galaxies it contains. The dis- 
tributions of morphological types in ORSd (dashed lines) 
and ORSm (solid lines) are shown in Fig. ^ for Vmax = 4000 
km s"^(panel a) and 4000 < Vmax < 8000 km s~^(panel 
b). As expected, the morphological composition of ORSm is 
clearly skewed towards early-types relative to ORSd. Thus, 
the larger clustering in ORSm seen in Fig. ^ reflects segre- 
gation as a function of morphology and/or surface bright- 
ness. As attested by panel |^ 6, the fraction of late-type, low 
surface brightness galaxies in ORSm and ORSd is markedly 
reduced beyond V ~ 4000 km s~^. This is expected to occur 
in any flux-limited (or diameter-limited) sample since these 
galaxies tend also to be less luminous (and smaller) than 
typical spirals and ellipticals. The reduced difference in the 
morphological mix of ORSm and ORSd for Vmax ~ 8000 
km s~^renders their ^(s) very similar in this larger volume. 
Since morphological segregation is more pronounced in 
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Figure 9. The correlation function for the cluster-removed ORSd 
and ORSm samples velocity limited at Vmax = 4000 km . 



clusters, we test the effect described above by removing the 
6 clusters shown in Table ^ and rederiving ^(s) for both 
ORSm and ORSd out to Vma:c = 4000 km s~\ This list 
contains the most prominent clusters within this volume. 
Columns 1, 2, 3 and 4 list the name and position of each 
cluster. Columns 5 and 6 give the length of the finger-of- 
God and the angular radius assumed to be covered by the 
cluster on the sky. Columns 7 and 8 give the number of 
galaxies in each cluster for ORSm and ORSd. Figure |^ show 
the result of this experiment. The difference in f (s) is now 
clearly reduced (notice the change in the range in ^(s) rela- 
tive to Fig. ^). We thus confirm that ^(s) depends on selec- 
tion criteria as long as small and local volumes, containing 
a significant fraction of low surface brightness/late- types, 
are considered. The presence of high-density regions such 
as clusters may also enhance the dependence on selection 
criteria. This issue had already been addressed by Zucca 
et al. (1991), who found very similar correlation functions 
for magnitude and diameter-limited samples in the northern 
hemisphere, independent of the value used for Vmax- Their 
result, however, was based on volume-limited samples con- 
taining at most 450 galaxies. In addition, the difference in 
morphological mix between their magnitude and diameter- 
limited samples limited at Vmax = 4000 km s~^ is smaller 
than in ORSm and ORSd (see Table |^ of this paper and 
Table 2 of Zucca et al. 1991). 



4.3 Morphological subsamples 

In this section we investigate the existence of morphological 
segregation in ORS. We split each of the five ORS subsam- 
ples compared in section 4.1 into three morphologically se- 
lected subsamples: early-types (T < 0), Sab (1 < T < 4) and 
late- types (T > 5). The number of galaxies in each morpho- 
logical bin for each ORS subsample is given in Table pi We 
separately fit a selection function to each of these 15 morpho- 
logical subsamples using the method described in Paper II 
and then merge the morphological subsamples together to 
create the corresponding ORSd and ORSm morphological 
subsamples. We are then left with 6 morphological subsam- 
ples, 3 for ORSm and 3 for ORSd. 

We thus properly account for the selection effects in- 



Sample 


ESGC 


ESOd 


ESOm 


UGCd 


UGCm 


Early 


141 


267 


475 


223 


470 


Sab 


142 


336 


426 


353 


470 


Late 


317 


542 


404 


681 


617 



Table 4. Power-law fits and parameters for the morphological 
subsamples 



Sample 


Type 


limit 


Range 


7s 


so 






(h-i Mpc) 


(/i-i Mpc) 








Early 


40 


1.3 - 16 


1.57 


5.65 


ORSd 


Sab 


40 


1.3 - 16 


1.53 


4.24 




Late 


40 


1.3 - 16 


1.35 


3.78 




Early 


40 


1.3 - 16 


1.52 


6.70 


ORSm 


Sab 


40 


1.3 - 16 


1.32 


5.63 




Late 


40 


1.3 - 16 


1.17 


4.71 



duced by the dependence of the luminosity (or diameter) 
function on morphology and by non-homogeneities in sam- 
ple selection. In fact, the luminosity function is known to 
vary within the morphological bins defined above (Binggeli, 
Sandage & Tammann 1988, Loveday et al. 1992, Paper II). 
The broad morphological binning chosen in this work re- 
flects the compromise between this dependence and the need 
to use sizeable samples to reliably derive (j){r) and ^(s). It 
also has the advantage of yielding subsamples with roughly 
similar numbers of objects and avoiding systematic differ- 
ences in morphological classification likely to exist among 
the ESO, UGC and ESGC. In fact, it has been suggested 
that spirals may have been misclassified as early-types in 
UGC due to the lower quality plate material used for this 
catalog (Santiago & Strauss 1992). This effect is likely to 
be more pronounced at higher redshifts, and in fact some 
evidence of this is visible in the type-dependent selection 
functions used here. 

Given the increasing sparseness of late-type galaxies 
(T > 5) beyond Vmax = 4000 km s~^, leading to strong 
shot-noise problems, we decided to restrict our morphology- 
clustering investigation to Vmax = 4000 km s~^. Thus, for 
each of the six morphological ORS subsamples we computed 
^(s) using the samples limited at Vmax ~ 4000 km s~^. The 
weighting is again given by the inverse of the new type- 
dependent selection functions, with the appropriate addi- 
tional factors used when combining the ORS subsamples. 
The type correlation functions of ORSd and ORSm are 
shown in Fig. and 0, respectively. Results from power- 
law fits are presented in Table ^. In each case the range in 
scales used was that where ^(s) was well approximated by a 
power- law. 

From the figures and Table ^, we conclude that there is 
strong evidence for morphological segregation out to scales 
of lO/i"^ Mpc in both ORSd and ORSm. We thus confirm 
the results of many previous works, e.g., Lahav & Saslaw 
(1992), Santiago & Strauss (1992), Mo et al. (1992) and 
Loveday et al. (1995). The early- type galaxies are more clus- 
tered than the Sa/Sb galaxies, which in turn are more clus- 
tered than the late-type galaxies. For any given morpho- 
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Table 2. Parameters for removed clusters 
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Figure 10. Correlation functions for three morphologically se- 
lected subsamples of ORSd. Early types: diamonds, Spirals; stars, 
Late types: triangles. 
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Figure 12. The relative biasing between early and late type 
galaxies in the ORSd (panel a) and ORSm (panel b). The dotted 
line is the mean value of b^arly /^Late in each case. 



Figure 11. Same as Fig. |lO| but for the ORSm. 

logical subsample, ^(s) is very similar in both ORSd and 
ORSm. 

By comparing the clustering amplitudes of different 
morphological subsamples we can infer the corresponding 
relative bias factor as a function of scale using the relation 

€,{s)Early = (filiate- The SCale dependence of bEarly/bLate 

is shown in Fig. Il3 for ORSd and ORSm. In spite of the noise 
in both panels of the figure, the two samples lead to quite 
similar results: bEariy/bLate ~ 1-5 on small scales (s ~ 3h~^ 
Mpc), with a declining trend towards larger scales, where 
bEariy/bhate ~ 1. Relative bias factors can also be inferred 
from the fits made by Loveday et al. (1995) to their type- 



dependent 5(s). A relative bias of ~ 1.4 — 1.5 between E/SO 
and Spi/Irr is derived from their data at s ~ 5/i~^ Mpc. 
Removing the clusters listed in Table |^ makes little differ- 
ence to the results. A scale dependence is also suggested by 
the UGC and ESO subsamples considered individually, in- 
dicating that it is not an artifact of the way we combined 
the subsamples or of the weighting scheme used. Such a scale 
dependence is expected from non-linear biasing, since an en- 
hancement in the number of E/SOs relative to late- types in 
small-scale, high density regions such as clusters contributes 
to an enhancement in their relative pair counts on all scales. 
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5 REAL-SPACE CLUSTERING 

The radial velocities are a combination of the Hubble flow 
and peculiar motions. The effect of gravitational peculiar 
motions on small scales is to smear out high-density regions 
such as clusters along the radial line-of-sight. On large scales 
a contraction is expected due to infall into high-density re- 
gions. This gives rise to the well-known (rp, 7r)-diagrams (e.g. 
Fisher et al. 1994, Marzke et al. 1995). In this section we de- 
termine the real-space correlation function, ^(r), in order to 
assess the effect of peculiar motions on the results shown in 
section ^. We essentially follow the prescriptions of Davis & 
Peebles (1983). In brief, we determine the two dimensional 
correlation function ^(rp,7r), where is the separation per- 
pendicular to the line of sight, and tt is the separation along 
the line of sight. Only pairs separated by less than 50° in 
the sky and 50/i~^ Mpc in space were included. As in ^(s), 
we adopted the Ifamilton estimator and no selection effects 
were applied to the random catalogue. 

Given ^(rp,7r) one may obtain the projected function 
w{rp) by integrating (,{rp,n) along the 7r-direction. Once 
w{rp) is found one may determine the full ^(r) by solving 
the Abel integral 

numerically. In Figure |l^ we show our determination of ^(r) 
for the ORSd and ORSm limited at 4000 km s"^ (panel 
a) and 8000 km s^^(panel 6), and compare it to ^(s). 
We use r^^(r) since this is a less variable quantity with 
scale of separation. A number of interesting features are 
present in this plot. First, it confirms our results based 
on redshift space: ORSm is more clustered than ORSd for 
Vmax = 4000 km s~^. As before, this is not the case for the 
samples limited at Vmax ~ 8000 km s~^. Thus, the results 
shown in section 4.2 are not an artificiality produced by red- 
shift distortions. Figure ^ also shows that there are signifi- 
cant differences between ^(r) and ^(s) in all samples: ^(r) is 
larger than ^(s) on small scales, as expected. On large scales, 
linear perturbation theory gives a proportionality between 
^(s) and ^(r), with the proportionality constant a function 
of ;3 = nE! '^/b (Kaiser 1987). The determination of (3 from 
ORS is beyond the scope of this paper, and will be presented 
in a future study. 

Finally, we fitted a power law to ^(r). We derive 4.9 < 
ro < 7.3 and 1.5 < 7r < 1.7 for the samples shown in Fig. |l3[ 
The best fit values for those cut at Vmax = 8000 km s^^ are 
listed in Table |^. 



6 CONCLUSIONS 

Our conclusions are 

(i) There are significant variations in ^(s) among the ORS 
subsamples limited in velocity at Vmax = 4000 km s~^. The 
subsamples limited at Vmax = 8000 km s~^, on the other 
hand, yield very similar clustering patterns, except for the 
ESGC. This suggests that ESOd, UGCd, ESOm and UGCm 
are close approximations to a fair samples, at least for the 
purpose of estimating ^(s) for s ~ 20/i~^ Mpc. This is 
further demonstrated by the very nice agreement found be- 
tween ^(s) as determined here and those obtained from pre- 



vious optical surveys covering difi'erent volumes (Davis & 
Peebles 1983, de Lapparent et al. 1988, Maurogordato et al. 
1992, Loveday et al. 1995). We thus estimate that a sam- 
pling volume of ~ (75/i~^Mpc)'' is enough to represent the 
low-order clustering properties of the ensemble. 

(ii) We have investigated the clustering properties of 
galaxies as a function of selection criteria and concluded 
that significant differences in ^(s) arise between diameter- 
limited and magnitude- limited samples only for small and lo- 
cal volumes, where diameter-limited samples contain a larger 
fraction of low-luminosity, late-type galaxies. For larger 
volumes, both samples are dominated by more luminous, 
earlier-type objects, reducing the discrepancy in ^(s). These 
results are largely in agreement with Zucca et al. (1991), 
who used smaller samples. 

(iii) We confirm the existence of morphological segrega- 
tion out to scales of ~ 10h~^ Mpc as previously found 
by other authors (Lahav & Saslaw 1992, Mo et al. 1992, 
Dominguez-Tenreiro et al. 1994, Loveday et al. 1995). Early- 
type galaxies (—5 < T < 0) are more strongly clustered 
than late-types (T > 5), whereas Sa/Sb galaxies form 
an intermediate group. The difference in clustering ampli- 
tudes between E/SOs and Late- types suggests that the rel- 
ative bias factor between these two morphological types 
is weakly dependent on scale. Our results indicate that 
bEariy/bLate ~ 1.5 ou Small scales, showing a declining trend 
towards bEariy/bhate 1 ou larger scales (s ~ lQh~^ Mpc). 
Excluding clusters from the morphological samples does not 
change the overall picture; segregation is still present out to 
s ~ 10/i~^ Mpc and the apparent scale dependence of the 
relative bias factor remains. If real, this scale-dependence is 
in contradiction with the popular 'linear local biasing' pre- 
scription, in which bsariy and bLate are constants. It would 
also disagree with the linear limit of the biasing scheme pro- 
posed by Kaiser (1984). Such a scale dependence may be 
explained if we allow for non-linear biasing, e.g. by having a 
higher fraction of early- types formed in high-density regions. 

(iv) We compute the correlation function ^{rp,n) and 
from it, find ^(r), the real space correlation function. This 
experiment confirms our previous findings in redshift space: 
ORSm galaxies are more clustered than those in ORSd 
within Vmax = 4000 km s^^but not within Vmax ~ 8000 
km s~^. ^(s) is also shallower and of smaller amplitude than 
^(r) on small scales, as expected. 

(v) Our qualitative results are insensitive to the particu- 
lar kind of estimator used in deriving ^(s) and to corrections 
for peculiar velocities. They are also robust to uncertainties 
in the selection functions derived for ORS or to the particu- 
lar weighting scheme used to compensate for selection effects 
or to combine the different subsamples. 

An interesting question is whether the results found in 
this work hold when higher order clustering statistics are 
used. These latter are also important as means of testing 
scale- invariance relations (Bouchet et al. 1993). The deter- 
mination of counts in cells and high order correlations is in 
progress and will be shown in a forthcoming paper (Hermit 
et al. 1996, in preparation). 
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Figure 13. The real space correlation function r^^(r) plotted against separation r for the ORSd (stars) and ORSm (diamonds) limited 
at 4000 km s~^ (panel a) and 8000 km (panel b). Shown is also the redshift space correlation function, s^^(s) for ORSd (dashed 
line) and ORSm (solid line). 
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